CDS
Accession Number | TCMCG075C03471 |
gbkey | CDS |
Protein Id | XP_017983843.1 |
Location | complement(join(32559197..32559388,32559481..32559797,32559920..32559995,32560092..32560175,32560280..32560966,32561066..32562405,32563031..32563061,32564051..32564143,32564401..32564466)) |
Gene | LOC18613794 |
GeneID | 18613794 |
Organism | Theobroma cacao |
Protein
Length | 961aa |
Molecule type | protein |
Topology | linear |
Data_file_division | PLN |
dblink | BioProject:PRJNA341501 |
db_source | XM_018128354.1 |
Definition | PREDICTED: uncharacterized protein LOC18613794 isoform X1 [Theobroma cacao] |
EGGNOG-MAPPER Annotation
COG_category | O |
Description | protein folding |
KEGG_TC | - |
KEGG_Module |
M00404
[VIEW IN KEGG] |
KEGG_Reaction | - |
KEGG_rclass | - |
BRITE |
ko00000
[VIEW IN KEGG] ko00001 [VIEW IN KEGG] ko00002 [VIEW IN KEGG] ko01000 [VIEW IN KEGG] ko03009 [VIEW IN KEGG] ko03110 [VIEW IN KEGG] ko04131 [VIEW IN KEGG] |
KEGG_ko |
ko:K03500
[VIEW IN KEGG] ko:K09508 [VIEW IN KEGG] ko:K09519 [VIEW IN KEGG] ko:K14007 [VIEW IN KEGG] ko:K19370 [VIEW IN KEGG] |
EC |
2.1.1.176
[VIEW IN KEGG]
[VIEW IN INGREDIENT] |
KEGG_Pathway |
ko04141
[VIEW IN KEGG] map04141 [VIEW IN KEGG] |
GOs | - |
Sequence
CDS: ATGCAGGGCGATGAAGCCAGACTCTTGCTAGGCTTCCCCCCTAATTCTCGCCCTACTCCTTCTCAGGTAAAAGCAGCTTATAGAAAGAAAGTATGGGAGTCGCATCCTGACTTGTTTCCTGTTCACGAAAAACCTAAGGCGGAGTCTAAGTTCAAGTTGATTTTTGAAGCTTATACTTGCCTACAGTCTGAGATGGCCCCACTTCGGTCCACAGGATATGTTGACCCTGGATGGGAGCATGGGATTGCTCAAGATGAAAGGAAAAAGAAGGTTAAATGCAACTACTGTGGGAAAATAGTCAGTGGTGGAATATTCAGATTGAAGCAACATTTAGCCAGATTGTCTGGAGAAGTTACTCACTGTGAAAAGGTTCCTGAAGAAGTATGCTTGAATATGAGAAAGAACCTTGAAGGATGCCGTTCTGGTCGAAAACGAAGGCAATCAGAATATGAACAGGCTGCTCTAAATTTCCAATCTAATGAGTACAATGATGCAGAAGAAGCATCGGCAGGTTATAAACACAAAGGCAAGAAAGTGATGGGTGACAAGAACTTGGTCATCAAGTTTACCCCTCTTCGATCATTAGGATATGTGGACCCGGGATGGGAACATTGCGTTGCTCAAGATGAGAAGAAGAAAAGAGTAAAATGCAACTATTGCGAAAAAATAATAAGTGGGGGCATAAATCGGTTTAAGCAACATCTTGCTAGGATCCCTGGAGAAGTTGCATATTGTGAAAAGGCACCTGAGGAGGTATATCTCAAAATCAAAGAAAATATGAAATGGCACCGTACTGGCAGAAGGCATCGAAAACCTGATACCAAGGAGATATCTGCTTTCTACTTGCACTCAGATAATGAGGATGAAGGTGGAGAGGAGGATGGGTATTTGCAATGTATAAGTAAGGACATACTGGCTATTGACGATAAAGTTTCTGATAGTGACATTAGAAATAATAATGTCAGAGGTAGATCTCCTGGTAGTAGTGGTAATGGTGCTGAACCACTACTTAAAAGATCAAGACTGGATTCGGTATTTTTAAAGTCGCTGAAAAGCCAGACATCAGCACACTACAAACAAACAAGAGCAAAAATAGGTTTCGAGAAGAAAACTCGCAGGGAAGTGATATCTGCTATATGCAAATTCTTTTATCATGCAGGAATCCCTTCTAATGCAGCAAACTCTCCGTACTTCCATAAAATGCTGGAAGTGGTTGGTCAGTATGGGCAGGGTTTGCAAGGTCCTTCAAGTCGAATCATATCTGGTCGTCTCCTTCAGGAAGAGATTGCTAATATTAAAGAGTATCTGGCGGAGTTTAAGGCATCTTGGGCTATTACTGGTTGTTCTGTCATGGCTGACAGTTGGAATGATGCACAAGGAAGGACCCTGATTAACTTTTTGGTCTCTTGTCCTCGCGGTGTTTGTTTTCTCTCTTCTGTTGATGCAACTGATATGATAGAAGATGCTGCTAATCTCTTCAAGTTGTTAGACAAAGCAGTGGATGAGGTTGGCGAGGAATATGTAGTCCAGGTAATCACTAGGAACACTTTGAGTTTCAGGAATGCTGGAAAGATGCTTGAAGAGAAAAGGAGAAATTTATTTTGGACACCATGTGCTGTCTATTGCATTGATAGAATGCTTGAGGATTTTTTGAATATAAAATGGGTGGGAGAATGCATAGATAAAGCAAAAAAGGTGACAAGGTTTATTTATAACAATACCTGGTTGTTGAATTTTATGAAGAAAGAATTTACGAAGGGACAGGAACTTCTTAAGCCAGCTGTCACCAAGTTTGGCACTAATTTTTTCACTTTACAGAGTATGTTGGACCAGAGGGTTGGTCTTAAGAAAATGTTCCAATCAAATCGATGGCTTTCCTCCCGCTTTTCCAAATTAGATGAAGGTAAAGAGGTTGAAAAAATTGTCTTAAATGTCACCTTTTGGAAGAAGATGCAGTATGTGAAGAAATCCTTAGAGCCAGTTGCTGAAGTTCTTCAAAAGATAGGTAGTGATGAAATCCGATCAATGCCATTTATCTATAATGACATATGTAGAACAAAGCTTGCAATTAAAGCCATTCATGGTGATGATGTGCGCAAATTTGGACCTTTCTGGAGTGTGATTGAAAACAATTGGAGTTCATTGTTCCATCATCCTCTTTATGTTGCTGCATACTTTCTCAATCCATCCTTCCGTTACTGCCCAGATTTTCTGATGAATCCTGAAGTAATTCGTGGTCTAAATGAGTGTATTGTTCGATTGGAGTCAGACAATGGGAAAAGGATTTCTGCATCCATGCAGATACCTGATTTTGTGTCGGCAAAAGCTGATTTTGGAACTGATTTGGCCATAAGTACTAGAAGTGAGCTTGATCCAGCTTCATGGTGGCAACAACATGGGATAAGTTGCTTAGAGCTGCAACGAATCGCCATACGCATACTAAGCCAGAGATGTTCATCGATTGGATGTCAGCATACCTGGAGTGTGTTTGATCAAGTTCACAGCAAAAGACGCAACTGTTTGTCTCGGAAGAGATTGAATGACCACACCTATGTTCATTACAACTTGCGACTGAGAGAACGCCAACTAGGAAGGAAGCCTGATGATTTGGTTTCCTTTGACAGTGCCATGTTAGAAAGTGTATTAGATGACTGGCTTGTGGAGTCAGAGAAGCAAGCCATGCAAGAAGATGAGGAGATTATTTATAATGAGGTGGAACAATTTTATGGAGATGATATGGATGAACATGTGAGTGAAGAAAAGAGACCTACAGAAATGGTCACGTTAGCTAGTTTGGTTGAACCATTGGATGTTAATCCTGCTGCTGGAGGTGTTACCACTGATGATGATGGTCTCGATTTTCTTGATGATGATTTGACGGATTAG |
Protein: MQGDEARLLLGFPPNSRPTPSQVKAAYRKKVWESHPDLFPVHEKPKAESKFKLIFEAYTCLQSEMAPLRSTGYVDPGWEHGIAQDERKKKVKCNYCGKIVSGGIFRLKQHLARLSGEVTHCEKVPEEVCLNMRKNLEGCRSGRKRRQSEYEQAALNFQSNEYNDAEEASAGYKHKGKKVMGDKNLVIKFTPLRSLGYVDPGWEHCVAQDEKKKRVKCNYCEKIISGGINRFKQHLARIPGEVAYCEKAPEEVYLKIKENMKWHRTGRRHRKPDTKEISAFYLHSDNEDEGGEEDGYLQCISKDILAIDDKVSDSDIRNNNVRGRSPGSSGNGAEPLLKRSRLDSVFLKSLKSQTSAHYKQTRAKIGFEKKTRREVISAICKFFYHAGIPSNAANSPYFHKMLEVVGQYGQGLQGPSSRIISGRLLQEEIANIKEYLAEFKASWAITGCSVMADSWNDAQGRTLINFLVSCPRGVCFLSSVDATDMIEDAANLFKLLDKAVDEVGEEYVVQVITRNTLSFRNAGKMLEEKRRNLFWTPCAVYCIDRMLEDFLNIKWVGECIDKAKKVTRFIYNNTWLLNFMKKEFTKGQELLKPAVTKFGTNFFTLQSMLDQRVGLKKMFQSNRWLSSRFSKLDEGKEVEKIVLNVTFWKKMQYVKKSLEPVAEVLQKIGSDEIRSMPFIYNDICRTKLAIKAIHGDDVRKFGPFWSVIENNWSSLFHHPLYVAAYFLNPSFRYCPDFLMNPEVIRGLNECIVRLESDNGKRISASMQIPDFVSAKADFGTDLAISTRSELDPASWWQQHGISCLELQRIAIRILSQRCSSIGCQHTWSVFDQVHSKRRNCLSRKRLNDHTYVHYNLRLRERQLGRKPDDLVSFDSAMLESVLDDWLVESEKQAMQEDEEIIYNEVEQFYGDDMDEHVSEEKRPTEMVTLASLVEPLDVNPAAGGVTTDDDGLDFLDDDLTD |